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Mean telomere length (TL) in blood cells is heritable and has been reported to be associated with risks of several 
diseases, including cancer. We conducted a meta-analysis of three GWAS for TL (total /i=2240) and selected 
1629 variants for replication via the "iCOGS" custom genotyping array. All ^200 000 iCOGS variants were ana- 
lysed with TL, and those displaying associations in healthy controls (n = 1 5 065) were further tested in breast 
cancer cases (n = 1 1 024). We found a novel TL association (Ptrend < 4 x 10~^°) at 3p14.4 close to PX/Cand evi- 
dence (Ptrend < 7 X 10"^ for TL locI at 6p22.1 (ZNF311) and 20q11.2 (BCL2L1). We additionally confirmed 
(Ptrend < 5 X lO"^"*) the provlously reported loci at 3q26.2 (TERC), 5p15.3 {TERT) and 10q24.3 (0BFC1) and 
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found supportive evidence (Ptrend < 5 x 10"'*) for the published loci at 2p16.2 (ACYP2), 4q32.2 (NAF1) and 
20q13.3 (RTEL1). SNPs tagging these loci explain TL differences of up to 731 bp (corresponding to 18% of 
total TL in healthy individuals), however, they display little direct evidence for association with breast, ovarian 
or prostate cancer risks. 



INTRODUCTION 

Human chromosomes are capped and stabilized by telomeres, 
which are predominantly composed of several thousand DNA 
hexamer repeats (1-3). Mean telomere length (TL) is strongly 
heritable (4-6), but telomeres also shorten with each cell division, 
and consequently with age (7-11). Thus, shorter telomeres 
(assayed as mean length in DNA from blood cells) have been 
hypothesized to predispose to a number of diseases of aging, in- 
cluding certain cancers. Attempts to test this hypothesis in blood 
drawn after cancer diagnosis do indeed show cases to have 
shorter telomeres than their study-matched controls (12-14). 
However, prospectively designed studies, which use stored 
blood samples collected prior to cancer diagnosis, have failed to 
confirm this hypothesis (13,15-18). Such prospective designs 
are better able to address the question of whether telomeres 
become shorter before cancer diagnosis, and are thus predictive 
of cancer development. Another way of addressing the same 
hypothesis is to identify genetic variants that are associated with 
differences in TL in blood and then examine the associations of 
these same variants with cancer risks, both directly and in Mendel- 
ian randomization studies (19-22). If shorter telomeres are direct- 
ly responsible for increased cancer risk, then genetic variants that 
reduce TL should also increase cancer risk and, given appropriately 
powered studies, this effect should be detectable. 

Previously, several small GWAS with TL, as well as a meta- 
analysis (« = 12 000 subjects), have been published (23-26). 
More recently, a meta-analysis of over 37 000 individuals found 
seven loci affecting TL, and a fine-mapping study carried out in 
parallel to this work (and using the same individuals) has found 
variants associated with TL at the TERT locus (coding for the 
protein subunit of telomerase, a complex with an integral role 
in telomere maintenance) (27,28). Together, all these publica- 
tions have identified variants at nine loci which are significantly 
associated with TL: TERC (3q26.2) (23,25-27) (the RNA sub- 
unit of telomerase), TERT {5^15 .33) (27,28), OFBCl (10q24.3) 
(24,26,27), ACYP2 (2pl6.2) (27), NAFl (4q32.2) (27), ZNF208 
(19pl2) (27), RTELl (20ql3.3) (27), CTCl (17pl3.1) (26) and 
ZNF676 {\9^\2) (26). 

The lUumina^'^ "iCOGS" custom genotyping chip (28) was 
designed as a collaborative project, principally involving four 
international consortia; the Breast Cancer Association Consor- 
tium (BCAC) (29) (http://ccge.medschl.cam.ac.uk/consortia/ 
bcac/), the Ovarian Cancer Association Consortium (OCAC) 
(30) (http://ccge.medschl.cam.ac.uk/consortia/ocac/), the Con- 
sortium of Investigators of Modifiers of BRCAl/2 (CIMBA) 
(3 1 ,32) (http://ccge.medschl.cam.ac.uk/consortia/cimba/index. 
html) and Prostate Cancer Association Group to Investigate 
Cancer Associated Alterations in the Genome (PRACTICAL) 
(33,34) (http://ccge.medschl.cam.ac.uk/consortia/practical/). 
This chip assays ~200 000 SNPs chosen by many researchers 
with different study aims. More than 200 000 participants in 



many cancer studies have already been typed using this chip, 
26 089 of whom TL had also been determined. 

Our aim, made possible by the Collaborative Oncology 
Gene-environment Study (COGS) project, was to carry out a 
larger GWAS than those currently published and test any con- 
firmed TL variants for additional associations with breast, 
ovarian and prostate cancer risks. 



RESULTS 

GWAS for mean TL in blood cells 

A Manhattan plot of all 187 647 SNPs on the custom "iCOGS" 
chip in all 26 089 participants (healthy controls and cancer 
cases) in whom TL was measured is shown in Figure 1. Full 
results of the marked 'best SNP per peak' analyses are given in 
Table 1. SNPs at four loci were associated at -Ptrend < 10~^ in 
the analysis of healthy controls and these all increased in signifi- 
cance on inclusion of the cancer cases. The most significant of 
these mapped to the 3q26.2 locus containing the TERC gene 
(rsl3 17082, Arend = 1 X 10"^''). This lead SNP had an effect 
equivalent to a TL 77 bp shorter for every minor allele carried. 
The second locus at 5pl5.3 contains the functionally related 
TERT gene (rs7726159 Arend = 5 x 10"'^) with a similarly 
sized per-allele effect to the TERC SNP, albeit with the minor 
allele associated with longer telomeres. The third most signifi- 
cant locus, at 10q24.3, contains the OBCFl gene. Here, the 
minor allele of the lead SNP (rs2487999, Arend= 4 x 10"'"*) 
has a lower frequency (MAF = 0.10) but a greater effect size, 
a TL 100 bp longer for every minor allele carried. The fourth 
locus at 3pl4.4, containing the PXK gene, is a novel finding 
(lead SNP rs6772228, Ptrend= 4 x 10~'") and has the greatest 
effect on TL, equivalent to 120 bp shorter TL per-allele. We 
also note a further two, previously unreported loci, containing 
SNPs displaying evidence for association with TL: 6p22. 1 con- 
taining the ZNF31I gene (lead SNP rs9257445, Arend = 1 x 
10"^) and 20qll.2 containing BCL2L1 (lead SNP rs6060627, 
/"trend = 6 X 10~^), with per-minor-allclc effects equivalent to 
a 38 bp shorter TL and a 36 bp longer TL, respectively. In add- 
ition to these six loci, SNPs at the DMRTl locus (9p24.3) 
showed an association with TL at Arend = 1 x I0~^ in the 
control-only analysis, but did not replicate upon addition of the 
data from cancer cases (case and control analysis, Arend = 2 x 
10~^; Supplementary Material, Table SI). 

Codd et al. (27) reported significant TL associations with four 
further SNPs which were not present on the iCOGS chip. 
However, good surrogates for three of these were on the chip 
and the results for these are shown in Table 3 . The best surrogate 
SNP for the reported 2pI6.2 lead variant (rsl 1125529) was 
rsl0165485 (pairwise r = 0.98) (35), and we find the minor 
allele of rsI0165485 to be associated with longer TL (Arend = 
2.4 X 10~^) in contrast to the published finding of a per-allele 
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Figure 1. Manhattan plot of all iCOGS SNPs and TL in all 26 089 participants (healthy controls and cancer cases) from the CCHS, CGPS and SEARCH studies. Solid 
squares represent the negative log of the per-risk-allele Pirend against the genome position of each SNP. The lead SNPs in each of the peaks are marked with a larger 
square and the SNP name and gene region noted. Black open circles mark the results (in 26 089 COGS cases and controls) for the top SNPs originally selected from the 
GWASmeta-meta-analysis(chr3, rsl0936601,P„end= 3.6 x 10"'^; chr3, rsl 1709840, Ptrend = 6.5 x 10""* andchr4, rs930306,Ptrend = 8.4 x 10"^). 



decrease in TL for rsl 1 125529. The best sun^ogates for the pub- 
lished lead SNPsat4q32.2(rs7675998)and20ql3.3 (rs755017) 
are iCOGS SNPs rs2320615 and rs2738783 [r = 0.89 and 1.0, 
respectively (35)] which both have iCOGSArend < 5.0 x 10""^. 
There were no good surrogates for the 19pl2 locus lead SNPs on 
iCOGS. 

Proportion of variance in mean TL explained 

Least-squares regression of the unranked genotype on TL resi- 
duals (after regression with age, gender, case -control status 
and study) led to estimates that SNPs tagging loci associated 
with TL at Arend < 7 X 10~^ individually explained from 0.06 
to 0.2% of the total variation in TL and had an F-statistics 
from 7-33 (Table 1). This indicates that these SNPs are likely 
to be valid instruments for Mendelian randomization studies 
as an F-statistics > 10 is considered to be a valid instrument 
(36). The distribution of the allele scores — the sum of the 
number of 'shorter telomere' alleles, be they the minor or 



major allele — with these six SNPs approaches a normal distribu- 
tion and mean TL decreases considerably with increasing 
numbers of shorter TL-associated alleles, both for cases and con- 
trols (Fig. 2). 



Associations with cancer risks 

We examined the top SNP from each of our confirmed TL loci for 
additional associations with risks of all hormone-related cancers 
among the consortia participating in COGS (Table 2). All ana- 
lyses were confined to participants of European ancestry, thus 
we examined breast cancer risk in 41 BCAC studies (« = 89 
050 cases and controls), ovarian cancer risk in up to 34 OCAC 
studies (« = 39 774), of whom 8372 cases had serous epithelial 
ovarian cancer and 979 had serous low malignant potential 
(LMP) ovarian neoplasias and prostate cancer risk in 32 
studies participating in the PRACTICAL Consortium (w = 44 
620) (29,30,32). 
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Table 1. Association between iCOGS SNPs, TL and cancer risk 



Region 


Chr 


SNP 


Chr Position 
(Build 36) 


MAP 


TL association 

CCHS and SEARCH 

controls (« = 15 065) 
p 

^ trend 


TL association 
CCHS, CPGS and SEARCH 
cases and controls (n = 26 089) 
Per-allele ATL (95% CI) /^t^nd 




(%) 


F-statistic 


TERC 


3 


rsl317082 


170 980 279 


0.25 


1.04E-13 


-77 (-57 to -98) 


1.33E- 


19 


0.2 


30 


TERT 


5 


rs7726159 


1 335 319 


0.34 


6.10E-12 


73 (55 to 92) 


4.67E- 


17 


0.3 


33 


OBFCl 


10 


rs2487999 


105 649 816 


0.10 


7.30E-09 


100 (70 to 129) 


4.22E- 


14 


0.2 


23 


PXK 


3 


rs6772228 


58 351 059 


0.05 


7.77E-08 


-120 (-83 to -158) 


3.91E- 


10 


0.2 


23 


ZNF3J1 


6 


rs9257445 


29 057 185 


0.25 


3.43E-05 


-38 (-17 to -58) 


1.38E- 


07 


0.06 


7 


BCL2L1 


20 


rs6060627 


29 725 820 


0.30 


8.33E-04 


36 (17 to 55) 


6.45E- 


07 


0.06 


7 



Single SNP estimates for the lead variants from each of the six strongest association peaks in the meta-analysis of all case and control participants (« = 26 089) for 
association with TL are given in Table 1 (see also Fig. 1 , and for study-by-study estimates for the three control and two case populations, Supplementary Material, 
Table S2). The difference in TL per-minor-allele is expressed as the change in TL (ATL), with mean TL (95% CI), using the common homozygote as reference (see 
Supplementary Material) for the case and control analysis. Statistical significance (/"trend) is also given for the healthy control only (« = 15 065) analysis (see also 
Supplementary Material, Table S 1 ). and F-statistics are obtained from the least-squares regression of the unranked genotype on TL residuals (after regression with 
age, gender, case-control status and study). All cancer risk associations are presented as ORs with 95%i CI and per-allele Ptrend (Table 2). Ovarian cancer risks are 
broken down into serous invasive and serous LMP subgroups. MAP: minor allele frequency. 



None of the six SNPs tested displayed strong evidence for as- 
sociation with cancer, with the exception of TERT variant, 
rs7726159, which was significantly associated with risks of all 
three cancers [breast cancer odds ratio (OR) = 1 .05 [95% confi- 
dence intervals (95% CI)] 1.03-1.07,Arend = 2 X 10"^; serous 
invasive ovarian cancer 0R= 1.12 (95% CI 1.09-1.16), 
Arend=3 X 10~^; scrous LMP ovarian cancer 0R= 1.45 
(95% CI 1.36-1.55), Arend = 2 X 10""*; prostate cancer 
OR = 0.88 (95% CI 0.85-0.91), Arend = 2 x 10"'**]. The 
only other significant association, allowing for multiple 
testing, was OBFCl SNP rs2487999 with serous LMP non- 
invasive ovarian cancer risk Arend = 6 x 10"^, although the 
number of cases here (« = 979) was relatively small compared 
with the other case-control comparisons. 



DISCUSSION 

From our analyses, we find four loci that are associated with TL 
at genome -wide significance levels; the previously reported 
TERC (3q26.2), TERT (5^15. 2:2>) and OFBCl (10q24.3) loci, 
and a novel locus — PXK on 3pl4. 1 . We also find evidence for as- 
sociation at ZNF311 (6p22. 1) and BCL2L1 (20ql 1 .2), but these 
have not yet reached the accepted significance levels for 
genome-wide studies. The recent publication by Codd et al. 
(27) reported seven loci associated with TL and here we find 
genome-wide significance levels for three of these regions: 
TERC, TERT and OBFCl and supportive evidence (Arend < 
5.0 X lO""*) for a further three: ACYP2 (2pl6.2), NAFl 
(4q32.2), RTELl (20ql3.3). However, at 2pl6.2 we found the 
minor allele of suiTogate SNP (rsl0165485) to be associated 
with longer TL in contrast to the published rsl 1 125529 associ- 
ation with shorter TL. We did not have any good surrogate for 
the reportedly associated SNP at 19pl2 {ZNF208) and so could 
not test this. We could not confirm previously reported associa- 
tions (26) at CTCl (17pl3.1) and ZNF676 (19pl2) (data not 
shown). It should be noted that although the ZNF208 and 
ZNF676 loci are only ~ 350 kb apart, their lead SNPs are uncor- 
related [pairwise = 0.002 (35)]. 



In our own GWAS, we observed 272 SNPs with Ptrend < 1 0 
among healthy control subjects, significantly more than the 
expected number of 184. Eighty-six of these 272 variants 
became more significant upon addition of the data from cancer 
cases, demonstrating that despite having overall differences in 
TL, when compared with healthy controls, cancer cases continue 
to display consistent genetic associations with TL. In the entire 
set of 26 089 participants (analysis of both healthy controls 
and cancer cases), 595 SNPs displayed Arend < 10"^, a 3-fold 
excess over the 194 variants that would have been expected by 
chance. Of the 1558 SNPs nominated from the GWAS 
meta-analysis, three displayed Ptrend < 10""* in the total 26 
089 'Cases and Controls' analysis, including two SNPs at the 
TERC locus (marked on Fig. 1), and a further seven SNPs dis- 
played Arend < 10"^ (data uot showu). We note that none of 
the SNPs most significantly associated with TL at any of the 
loci were identified from the initial TL GWAS meta-analysis. 
This is evidence that our initial study was underpowered, and 
that future larger studies will continue to confirm the existence 
of many more TL-associated loci. 

Until fine-scale mapping has been completed, we cannot be 
certain of the functional gene at each locus. The most significant 
SNP at 3pl4.1, rs6772228, is in intron 4 of the PXATgene. This 
codes for a serine/threonine kinase involved in the regulation 
of electrical excitability and synaptic transmission, a seemingly 
unlikely candidate for involvement in telomere biology. It 
should also be noted that, from the study-by-study results 
shown in Supplementary Material, Table S2, the PXK associ- 
ation seems driven by the SEARCH cases and controls. The 
top SNP at 6p22.1 is 13 kb upstream of ZNF311, a gene which 
codes for a zinc finger protein that may work as a transcription 
regulator. The most significant SNP at 20ql 1 .2 is in intron 3 of 
the BCL2L1 gene, which belongs to the BCL-2 family of pro- 
teins, integral in the regulation of apoptosis. All six loci are 
being fine-mapped, with all SNPs correlated with the lead SNP 
in each region (r^ > 0.3) submitted for genotyping on another 
upcoming collaborative custom chip. 

From our data on the lead SNPs in these six most significant 
loci, the TL-altering alleles have additive effects in healthy 



Table 2. Association between iCOGS SNPs, TL and cancer risk 



Region 


Chr 


SNP 


Chr Position 


MAF 


Breast cancer association 






Ovarian cancer association 




Ovarian cancer association 




Prostate cancer association 










(Build 36) 




Overall risk (BCAC) 






Serous invasive (OCAC) 






Serous LMP (OCAC) 






Overall risk (PRACTICAL) 














46 451 cases, 42 599 controls 




8371 cases, 23 444 controls 




979 cases, 17 869 controls 




22 297 cases, 22 323 controls 














Per-allele OR (95% CI) 


P 

^ trend 




Per-allele OR (95% CI) 


p 

^ trend 




Per-allele OR (95% CI) 


p 

trend 




Per-allele OR (95% CI) 


p 

^ trend 




TERC 


3 


rs 13 17082 


170 980 279 


0.25 


0.99 (0.97-1.01) 


3.31E 


-01 


1.03 (0.99-1.08) 


1.31E- 


01 


0.89(0.78-1.00) 


4.61E 


-02 


0.96 (0.93-0.99) 


6.16E- 


03 


TERT 


5 


rs7726159 


1 335 319 


0.34 


1.05 (1.03-1.07) 


1.52E- 


-05 


1.12(1.09-1.16) 


3.36E- 


09 


1.45 (1.36-1.55) 


2.00E 


-14 


0.88(0.85-0.91) 


1.84E- 


18 


OBFCl 


10 


rs2487999 


105 649 816 


0.10 


1.03 (1.00-1.06) 


6.44E 


-02 


1.04(0.98-1.10) 


2.25E- 


01 


1.29(1.14-1.43) 


5.87E- 


-04 


1.06(1.02-1.11) 


1.09E- 


02 


PXK 


3 


rs6772228 


58 351 059 


0.05 


1.03 (0.98-1.07) 


2.59E 


-01 


1.02(0.93-1.10) 


6.94E- 


01 


1.04(0.82-1.25) 


7.37E- 


-01 


1.08(1.02-1.14) 


1.61E- 


02 


ZNF311 


6 


rs9257445 


29 057 185 


0.25 


1.03 (1.01-1.05) 


4.86E 


-03 


0.97 (0.93-1.02) 


2.36E- 


01 


1.05 (0.94-1.16) 


3.61E 


-01 


1.04(1.01-1.08) 


6.58E- 


03 


BCL2L1 


20 


rs6060627 


29 725 820 


0.30 


0.98(0.96-1.00) 


5.29E 


-02 


1.03 (0.99-1.07) 


1.96E- 


01 


0.93 (0.83-1.04) 


1.80E 


-01 


0.97 (0.94-1.00) 


2.62E- 


02 



Table 2 reveals that the minor allele of TERT xill26\59 is associated with significant protection from prostate cancer [OR = 0.88 (95% CI 0.85-0.91), Ptrend = 2x10 but significantly increased risks of 
breast and ovarian cancers, particularly of LMP ovarian cancer [OR — 1 .45 (95% CI 1 .36- 1 .55), Ptrend = 2 x 10" '*]. We have double checked that this is not an allele-calling aitefact and we also note that the 
minor allele of a nearby SNP, rs40 1681, has similarly been reported to be associated with increased risks of cancers of the lung, bladder, testes, cervix and basal cell carcinoma, but with decreased risk of melanoma 
(37-39). These inverted associations may be due to tissue-specific interactions that need fiirther examination. 



Table 3. The associations with TL and cancer risk of three iCOGS SNPs that are surrogates for additional peaks of TL association found by Codd et al. (26) 



Region 


Chr 


Codd et al. 


Chr Position 


iCOGS 


Chr Position 


Pairwise 


TL association 


Breast cancer 


Ovarian cancer association 


Prostate cancer 






SNP 


(Build 36) 


SNP 


(Build 36) 


between SNPs 




association 






association 
















CCHS, CPGS and 


Overall risk 


Serous invasive 


Serous LMP 


Overall risk 
















SEARCH 


(BCAC) 


(OCAC) 


(OCAC) 


(PRACTICAL) 
















P 

^ trend 


P 

trend 


p 

^ trend 


P 

^ trend 


P 

trend 


ACYP2 


2 


rsl 1125529 


54 329 370 


rsl0165485 


54 335 140 


0.98 


2.4E-05 


3.9E-01 


8.2E-01 


9.0E-02 


6.7E-01 


NAFl 


4 


rs7675998 


164 227 270 


rs2320615 


164 289 399 


0.89 


3.3E-04 


7.6E-01 


4.7E-01 


4.4E-01 


4.3E-01 


RTELl 


20 


rs755017 


61 892 066 


rs2738783 


61 779 056 


1.00 


4.5E-04 


1.6E-01 


2.1E-01 


2.7E-02 


3.2E-04 



Ptrends for per-allclc differences in TL and cancer risk are calculated as for Table 1 . 
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75lh 

o 
3 

(C 
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0 & 1 2 



4 5 6 
6 SNP allele score 



10 



Numbers: 



controls; 


36 


298 


1317 


3117 


4109 


3413 


1826 


571 


109 


19 


breast cancer cases. 


29 


222 


985 


2245 


3010 


2440 


1352 


416 


75 


10 


TL(bp): 
controls: 


4439 


4444 


4365 


4255 


4175 


4087 


4060 


3955 


3985 


3708 


breast cancer cases: 


4295 


4196 


4159 


4141 


4071 


4025 


4015 


3915 


3836 


3424 


ATL (bD): 






















controls: 


0 


5 


-74 


-184 


-264 


-351 


-379 


-484 


-454 


-731 


breast cancer cases: 


0 


-99 


-136 


-154 


-224 


-270 


-280 


-380 


-459 


-871 



Figure 2. The cumulative effect of the top six SNPs on TL. Mean TL and associated 95% CI in base pairs (bp) is illustrated corresponding to the number of ' shorter 
telomere' alleles carried of the six SNPs. The results for the 10 784 breast cancer cases and 14 815 controls are shown in grey and black, respectively. An additional 
I'-axis on the right-hand side of the figure marks the TL at the 25th percentile (3438 for cases, 3525 for controls), 50th (median) (3903 for cases, 4007 for controls) and 
75% percentile of length (4492 for cases, 46 1 6 for controls), with cases and controls in grey and black, respectively. Tabulated below the graph is the distribution of the 
six SNP 'shorter telomere' allele scores among 14 815 controlsbyfractionof total number of individuals (numbers), the mean TL by six SNP allele score (TL) and the 
difference in TL compared with the lowest allele score group (Oand 1) (ATL). There were no individuals present with an allele scoreof 1 1 or 12. Data for breast cancer 
cases are again shown in grey and control data in black. 



control individuals: those carrying 10 'shorter telomere' alleles 
have TL on average 73 1 bp shorter than individuals with one or 
fewer 'shorter telomere' alleles. Since TL decreases by ~20 bp 
per year of age ( 11 ,28), this is equivalent in magnitude to an age 
difference of 37 years respectively for these few individuals at 
the two extremes of the distribution (Fig. 2). Correspondingly, 
there is a mean 410 bp difference in TL between the controls 
(9%) with three shorter telomere alleles and those (4%) with 
eight shorter telomere alleles — equivalent to a > 20 year age dif- 
ference. 

The first of our study aims — to find genetic variants that are 
strongly associated with TL — has been successful, and we 
have therefore been able to continue to the second aim — to 
examine whether any of these TL loci also have additional 
effects on cancer risk in the hormone-related cancer data avail- 
able to us. When examined individually, only one of the top 
SNPs {TERT SNP rs7726159) displays highly significant asso- 
ciations with cancer risk (Table 2). This complex locus has 
already been the in-depth subject of two fine-scale mapping 
studies (28,40), and both these studies demonstrated that most 
of the TERT variants associated with TL and those associated 
with the strongest cancer risks are quite distinct. Although 
SNP rs7726 159 may have a direct functional effect on telomere 
maintenance, its association with cancer risk is more probably 
due to its linkage disequilibrium (correlation) with two other 
SNPs (rs2242652 and rsl0069690), which do have likely direct- 
ly functional effects on cancer risk (28,40). The only other 
reported TL loci displaying significant evidence of association 



with hormone-related cancers are OBFCl SNP rs2487999 
with serous LMP ovarian cancer [OR = 1.29 (95% CI 1.14- 
1.43); Arend = 6 X lO""*; Table 2], and RTELl SNP rs2738783 
with prostate cancer risk, [OR = 0.94 (95% CI 0.90-0.97); 
Arend = 3 X 10~^; Table 3]. Further investigation may be 
merited in these two cases. The remaining TL-associated SNPs 
have negligible effects on cancer risk and thus support the 
findings from prospective studies on TL and future cancer risk 
(13,18). 

In conclusion, this study confirms the genetic control of TL by 
common SNPs in at least six loci. None of the SNPs at these loci 
are likely to have substantial direct effects on risk of hormone- 
related cancer. This is further evidence refuting the hypothesis 
that shorter mean TL in blood cells is a major causal factor in 
breast, ovarian or prostate cancer development. 



MATERIALS AND METHODS 

GWAS and SNP selection 

Three individual GWAS were performed to look for variants 
associated with TL in three studies of disease-free individuals 
from the East Anglian region of Britain; men from the ProtecT 
prostate cancer case-control study (34) (« = 1148), women 
from the Sisters in Breast Screening (SIBS) study (41) (http:// 
ccge.medschl.cam.ac.uk/research/local/sibs-study/) {n = 796) 
and participants in the European Prospective Investigation 
into Cancer (EPIC-Norfolk) (http://www.srl.cam.ac.uk/epic/) 
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{n = 296). All the three studies had mean TL determined by the 
same quantitative real-time PCR (13) and genome- wide SNP 
genotypes determined using varying platforms (see Supplemen- 
tary Material). The ProtecT GWAS was carried out using a set of 
522 098 SNPs (after quality control (QC)) genotyped on the Illu- 
mma Infinium^'^ platform. The SIBS GWAS' was performed 
using a set of 255 05 1 SNPs (after QC), genotyped on the Illu- 
mina HumanCytoSNP- 1 2^^ chip. The EPIC GWAS genotyping 
was performed using a custom high-density array by Perlegen 
Sciences Inc. and after QC, 187 508 SNPs were included in 
further analyses. From all these datasets, the genotypes of 2 
448 093 SNPs with minor allele frequency (MAP) >0.01 were 
imputed using MaCH in all 2240 participants and a meta- 
analysis of the three studies was carried out using ProbABEL 
(see Supplementary Material). From this, we identified and 
selected 1629 potentially TL-associated variants (Arend < 
lO""*) for inclusion on the iCOGs chip. Of these, 1558 passed 
QC and were included in the final analyses. 

Associations witli TL 

Among the participating COGS studies, the TL had been deter- 
mined in cases and controls from the UK Studies of Epidemi- 
ology and Risk Factors in Cancer Heredity (SEARCH) study 
(13,28) (http://ccge.medschl.cam.ac.uk/search/) controls from 
the Danish Copenhagen City Heart study (28,42,43) (CCHS) 
and breast cancer cases from the Copenhagen General Popula- 
tion Study (18,44) (CGPS). TL was measured using a real-time 
PCR methodology as described elsewhere (11-13,45,46) and 
a composite variable, putting the SEARCH, CCHS and CGPS 
data onto the same scale, was used for all analyses (28) (see Sup- 
plementary Material). All 187 647 SNPs with MAF >0.02 on 
the iCOGS chip that passed QC criteria were tested for associ- 
ation with TL in these subjects. Initially, analysis was confined 
to the healthy control subjects within each study (SEARCH 
females, « = 6766; CCHS females, « = 4537E and CCHS 
male, n = 3762). A total of 272 SNPs were associated with 
TL, at Ptrend < 10~^, in these control subjects (Supplementary 
Material, Table SI). All associations are consistent with a 
log-additive model. 

In an exploratory attempt to increase the available sample size 
and thus the study power of this study, the above analysis was 
repeated using breast cancer cases as well as the healthy controls 
from these cancer studies. Therefore, SEARCH cases (n = 
8210) and CGPS cases {n = 2814) were also included in the ana- 
lysis. Of the 272 variants with per-allele Arend < 10~^ in the 
control-only analysis, 86 variants increased in significance 
upon inclusion of TL data from these breast cancer cases in 
our analysis (Supplementary Material, Table SI). Among 
these were SNPs within all the loci previously and independently 
reported to be associated with TL; therefore, we feel confident 
that the inclusion of cancer cases is a valid way of increasing 
the study power. 
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